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IN THE CLAIMS: 

Please amend claims 9, 17, 26, 27, and 51 as follows. 

1 . (previously presented) An apparatus for converting an input voice signal 
into an output voice signal according to a reference voice signal, the apparatus 
comprising: 

extracting means for extracting only deterministic components from the input 
voice signal, the deterministic components including a plurality of sinusoidal wave 
components which are numbered sequentially, wherein the input voice signal includes 
the deterministic components and residual components; 

separating means for separating the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates which are numbered sequentially in 
a manner the same as the sinusoidal wave components; 

memory means for storing reference pitch information representative of a pitch of 
the reference voice signal, the pitch information including primary pitch information 
representative of a discrete pitch matching a music scale and secondary pitch 
information representative of a fractional pitch fluctuating relative to the discrete pitch, 
and storing reference amplitude information representative of reference amplitude value 
coordinates, which are numbered sequentially, of the sinusoidal wave components 
contained in the reference voice signal; 

first modulating means for modulating the frequency value coordinates of the 
sinusoidal wave components of the input voice signal according to the primary 
reference pitch information retrieved from the memory means, to generate modulated 
frequency value coordinates, the first modulating means further modulating the 
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modulated frequency value coordinates of the sinusoidal wave components of the input 
voice signal according to the secondary reference pitch information retrieved from the 
memory means, to generate further modulated frequency value coordinates; 

control means for setting control parameters effective to control degrees of the 
modulation of the frequency value coordinates by the primary reference pitch 
information and the secondary pitch information, respectively, so that a degree of 
influence of the pitch of the reference voice signal to a pitch of the output voice signal is 
determined according to the control parameters; 

second modulating means for modulating the amplitude value coordinates of the 
sinusoidal wave components of the input voice signal according to the reference 
amplitude information representative of the reference amplitude value coordinates 
which are numbered correspondingly to the amplitude value coordinates of the input 
voice signal, retrieved from the memory means, such that each amplitude value 
coordinate of the input voice signal is mixed with the corresponding reference amplitude 
value coordinate by a set ratio; 

combining means for combining each of the modulated frequency value 
coordinates and each of the further modulated amplitude value coordinates to 
synthesize sinusoidal wave components of the output voice signal having an output 
pitch and an output timbre different from an input pitch and an input timbre of the input 
voice signal, and influenced by a reference pitch and a reference timbre of the 
reference voice signal; and 

mixing means for mixing the synthesized sinusoidal wave components having 
the modulated frequency value coordinates to synthesize the output voice signal having 
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a pitch different from that of the input voice signal and influenced by the pitch of the 
reference voice signal. 

2. (Original) The apparatus as claimed in claim 1 , further comprising 
control means for setting a control parameter effective to control a degree of modulation 
of the frequency of each sinusoidal wave component by the modulating means so that 
a degree of influence of the pitch of the reference voice signal to the pitch of the output 
voice signal is determined according to the control parameter. 

Claim 3. (cancelled). 

4. (Original) The apparatus as claimed in claim 1 , further comprising 
detecting means for detecting a pitch of the input voice signal based on results of 
extraction of the sinusoidal wave components, and switch means operative when the 
detecting means does not detect the pitch from the input voice signal for outputting an 
original of the input voice signal in place of the synthesized output voice signal. 

5. (previously presented) The apparatus as claimed in claim 1 , wherein the 
mixing means mixes the plurality of the sinusoidal wave components having the 
modulated amplitudes to synthesize the output voice signal having a timbre different 
from that of the input voice signal and influenced by the timbre of the reference voice 
signal. 

6. (Original) The apparatus as claimed in claim 5, further comprising 
means for setting a control parameter effective to control a degree of modulation of the 
amplitude of each sinusoidal wave component by the modulating means so that a 
degree of influence of the timbre of the reference voice signal to the timbre of the 
output voice signal is determined according to the control parameter. 



600143395V1 



4 



PATENT 
51270-245583 

7. (Original) The apparatus as claimed in claim 1 , further comprising 
means for memorizing volume information representative of a volume variation of the 
reference voice signal, and means for varying a volume of the output voice signal 
according to the volume information so that the output voice signal emulates the 
volume variation of the reference voice signal. 

8. (Original) The apparatus as claimed in claim 1 , further comprising 
means for separating a residual component from the input voice signal after extraction 
of the sinusoidal wave components, and means for adding the residual component to 
the output voice signal. 

9. (currently amended) An apparatus for converting an input voice signal 
into an output voice signal according to a reference voice signal, the apparatus 
comprising: 

extracting means for extracting only deterministic components from the input 
voice signal, the deterministic components including a plurality of sinusoidal wave 
components which are numbered sequentially, wherein the input voice signal includes 
the deterministic components and residual components; 

separating means for separating the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates ASn' (n = 1 , 2, 3, ...); 

memory means for storing, as memorized amplitude value coordinates, 
reference amplitude information representative of reference amplitude value 
coordinates ATn (n = 1 , 2, 3, . . .), which are numbered sequentially, of the sinusoidal 
wave components contained in the reference voice signal; 

modulating means for modulating the amplitude value coordinates ASn' of the 
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sinusoidal wave components of the input voice signal extracted from the input voice 
signal according to the reference amplitude information representative of the reference 
amplitude value coordinates ATn, which are numbered correspondingly to the 
amplitude value coordinates of the input voice signal, and retrieved from the memory 
means by the following calculation (1 - [[y]] y) * ASn' + [[y]] y * ATn (n = 1 , 2, 3, . . .), 
where the parameter [[y]] y takes a value from zero to one and represents a degree of 
mixing ; and 

mixing means for mixing the plurality of the sinusoidal wave components having 
the modulated amplitude value coordinates to synthesize the output voice signal having 
a timbre different from that of the input voice signal and influenced by the timbre of the 
reference voice signal, 

wherein the modulating means comprises 

normalizing means for normalizing the amplitude value coordinates of the 

sinusoidal wave components of the input voice signal by a mean amplitude of 

the input voice signal, to generate normalized amplitude value coordinates, 
a second mixing means for mixing the normalized amplitude value 

coordinates of the input voice signal and the memorized amplitude value 

coordinates of the reference voice signal with one another by a predetermined 

ratio to produce mixed amplitude value coordinates, and 

multiplying means for multiplying the normalized amplitude value 

coordinates of the sinusoidal wave components of the input voice signal with the 

mean amplitude of the input voice signal. 

10. (Original) The apparatus as claimed in claim 9, further comprising 
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control means for setting a control parameter effective to control a degree of modulation 
of the amplitude of each sinusoidal wave component by the modulating means so that a 
degree of influence of the timbre of the reference voice signal to the timbre of the 
output voice signal is determined according to the control parameter. 

1 1 . (previously presented) The apparatus as claimed in claim 9, wherein the 
memory means further stores pitch information representative of a pitch of the 
reference voice signal, and the modulating means further modulates a frequency of 
each sinusoidal wave component of the input voice signal according to the pitch 
information, so that the mixing means mixes the plurality of the sinusoidal wave 
components having the modulated frequencies to synthesize the output voice signal 
having a pitch different from that of the input voice signal and influenced by the pitch of 
the reference voice signal. 

12. (Original) The apparatus as claimed in claim 1 1 , further comprising 
means for setting a control parameter effective to control a degree of modulation of the 
frequency of each sinusoidal wave component by the modulating means so that a 
degree of influence of the pitch of the reference voice signal to the pitch of the output 
voice signal is determined according to the control parameter. 

Claim 13 (cancelled). 

14. (Original) The apparatus as claimed in claim 9, further comprising 
detecting means for detecting a pitch of the input voice signal based on results of 
extraction of the sinusoidal wave components, and switch means operative when the 
detecting means does not detect the pitch from the input voice signal for outputting an 
original of the input voice signal in place of the synthesized output voice signal. 
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1 5. (Original) The apparatus as claimed in claim 9, further comprising 
means for memorizing volume information representative of a volume variation of the 
reference voice signal, and means for varying a volume of the output voice signal 
according to the volume information so that the output voice signal emulates the 
volume variation of the reference voice signal. 

16. (Original) The apparatus as claimed in claim 9, further comprising 
means for separating a residual component from the input voice signal after extraction 
of the sinusoidal wave components, and means for adding the residual component to 
the output voice signal. 

1 7. (currently amended) An apparatus for synthesizing an output voice signal 
from an input voice signal and a reference voice signal, the apparatus comprising: 

an analyzer device that analyzes only deterministic components contained in the 
input voice signal to derive a parameter set of an original frequency and an original 
amplitude, the deterministic components including a plurality of sinusoidal wave 
components which are numbered sequentially, wherein the input voice signal includes 
the deterministic components and residual components; 

a separating device to separate the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates ASn' (n = 1, 2, 3, ...), which are 
numbered sequentially in a manner the same as the sinusoidal wave components; 

a source device that provides reference information characteristic of the 
reference voice signal, the reference information being reference amplitude information 
representative of reference amplitude value coordinates ATn (n= 1, 2, 3, ...), which are 
numbered sequentially; 
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a modulator device that modulates the parameter set of the sinusoidal wave , 
components according to the reference information; 

a regenerator device that operates according to each of the parameter sets as 
modulated to regenerate each of the sinusoidal wave components so that at least one 
of the frequency and the amplitude of each sinusoidal wave component as regenerated 
varies from the original one, and that mixes the regenerated sinusoidal wave 
components together to synthesize the output voice signal; 

a second modulator device to modulate the amplitude value coordinates ASn' of 
the sinusoidal wave components of the input voice signal according to reference 
amplitude information, representative of amplitudes of the sinusoidal wave components 
contained in the reference voice signal ATn which are numbered correspondingly to the 
amplitude value coordinates of the input voice signal, to generate modulated amplitude 
value coordinates by utilizing the following calculation (1 - [[y]] y) * ASn' + [[y]] y * ATn 
(n = 1 , 2, 3, . . .), where the parameter [[y]] y takes a value from zero to one and 
represents a degree of mixing ; 

a combining device to combine the modulated frequency value coordinates and 
the modulated amplitude value coordinates to synthesize sinusoidal wave components 
of the output voice signal having an output pitch and an output timbre different from an 
input pitch and an input timbre of the input voice signal, and influenced by a reference 
pitch and a reference timbre of the reference voice signal. 

18. (previously presented) The apparatus as claimed in claim 17, wherein the 
source device provides the reference information characteristic of a pitch of the 
re f erence voice signal, and wherein the modulator device modulates the parameter set 
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of each sinusoidal wave component according to the reference information so that the 
frequency of each sinusoidal wave component as regenerated varies from the original 
frequency, the pitch of the output voice signal being synthesized according to the pitch 
of the reference voice signal. 

19. (cancelled). 

20. (previously presented) The apparatus as claimed in claim 17, wherein the 
source device provides the reference information characteristic of a timbre of the 
reference voice signal, and wherein the modulator device modulates the parameter set 
of each sinusoidal wave component according to the reference information so that the 
amplitude of each sinusoidal wave component as regenerated varies from the original 
amplitude, the timbre of the output voice signal being synthesized according to the 
timbre of the reference voice signal. 

21. (Original) The apparatus as claimed in claim 17, further comprising a 
control device that provides a control parameter effective to control the modulator 
device so that a degree of modulation of the parameter set is variably determined 
according to the control parameter. 

22. (Original) The apparatus as claimed in claim 1 7, further comprising a 
detector device that detects a pitch of the input voice signal based on analysis of the 
sinusoidal wave components by the analyzer device, and a switch device operative 
when the detector device does not detect the pitch from the input voice signal for 
outputting an original of the input voice signal in place of the synthesized output voice 
signal. 

23. (previously presented) The apparatus as claimed in claim 17, further 
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comprising a memory device that stores volume information representative of a volume 
variation of the reference voice signal, and a volume device that varies a volume of the 
output voice signal according to the volume information so that the output voice signal 
emulates the volume variation of the reference voice signal. 

24. (Original) The apparatus as claimed in claim 17, further comprising a 
separator device that separates a residual component other than the sinusoidal wave 
components from the input voice signal, and an adder device that adds the residual 
component to the output voice signal. 

25. (previously presented) A method of converting an input voice signal into 
an output voice signal according to a reference voice signal, the method comprising the 
steps of: 

extracting only deterministic components from the input voice signal, the 
deterministic components including a plurality of sinusoidal wave components which are 
numbered sequentially, wherein the input voice signal includes the deterministic 
components and residual components; 

separating the sinusoidal wave components into frequency value coordinates 
and amplitude value coordinates, which are numbered sequentially in a manner the 
same as the sinusoidal wave components; 

storing reference pitch information representative of a pitch of the reference 
voice signal, the pitch information including primary pitch information representative of a 
discrete pitch matching a music scale and secondary pitch information representative of 
a fractional pitch fluctuating relative to the discrete pitch, and storing reference 
amplitude information representative of reference amplitude value coordinates, which 
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are numbered sequentially, of the sinusoidal wave components contained in the 
reference voice signal; 

modulating the frequency value coordinates of the sinusoidal wave components 
of the input voice signal according to the primary reference pitch information, to 
generate modulated amplitude value coordinates, and further modulating the modulated 
frequency value coordinates of the sinusoidal wave components of the input voice 
signal according to the secondary reference pitch information retrieved from the 
memory means, to generate further modulated frequency value coordinates; 

setting control parameters effective to control degrees of modulation of the 
frequency value coordinates by the primary reference pitch information and the 
secondary pitch information, respectively, so that a degree of influence of the pitch of 
the reference signal to a pitch of the output voice signal is determined according to the 
control parameters; 

mixing the plurality of the sinusoidal wave components having the modulated 
frequency value coordinates to synthesize the output voice signal having a pitch 
different from that of the input voice signal and influenced by that of the reference voice 
signal; 

modulating the amplitude value coordinates of the sinusoidal wave components 
of the input voice signal according to the reference amplitude information representative 
of the reference amplitude value coordinates which are numbered correspondingly to 
the amplitude value coordinates of the input voice signal, retrieved from the memory 
means such that each amplitude value coordinate of the input voice signal is mixed with 
the corresponding reference amplitude value coordinate by a set ratio; and 
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combining the modulated frequency value coordinates and the modulated 
amplitude value coordinates to synthesize sinusoidal wave components of the output 
voice signal having an output pitch and an output timbre different from an input pitch 
and an input timbre of the input voice signal, and influenced by a reference pitch and a 
reference timbre of the reference voice signal. 

26. (currently amended) A method of converting an input voice signal into an 
output voice signal according to a reference voice signal, the method comprising the 
steps of: 

extracting only deterministic components from the input voice signal, the 
deterministic components including a plurality of sinusoidal wave components which are 
numbered sequentially, wherein the input voice signal includes the deterministic 
components and residual components; 

separating the sinusoidal wave components into frequency value coordinates 
and amplitude value coordinates ASn' (n = 1, 2, 3, . . .); 

storing, as stored amplitude value coordinates, reference amplitude information 
representative of reference amplitude value coordinates ATn (n = 1, 2, 3, . . .), which 
are numbered sequentially in a manner the same as the sinusoidal wave components, 
of the sinusoidal wave components contained in the reference voice signal; 

modulating the amplitude value coordinates ASn' of the sinusoidal wave 
components of the input voice signal extracted from the input voice signal according to 
the reference amplitude information representative of the reference amplitude value 
coordinates ATn, by the following calculation (1 - [[y]] y) * ASn' + [[y]] y * ATn (n = 1 , 2, 
3, . . .), where the parameter [[y]] y takes a value from zero to one and represents a 
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degree of mixing , which are numbered correspondingly to the amplitude value 
coordinates of the input voice signal such that each amplitude value coordinate of the 
input voice signal is mixed with the corresponding reference amplitude coordinate by a 
set ratio, retrieved from the memory means; and 

mixing the plurality of the sinusoidal wave components having the modulated 
amplitude value coordinates to synthesize the output vice signal having a timbre 
different from that of the input voice signal and influenced by the timbre of the reference 
voice signal; 

normalizing the amplitude value coordinates of the sinusoidal wave components 
of the input voice signal by a mean amplitude of the input voice signal, to generate 
normalized amplitude value coordinates; 

mixing the normalized amplitude value coordinates of the input voice signal and 
the stored amplitude value coordinates of the reference voice signal with one another 
by a predetermined ratio to produce mixed amplitude value coordinates; and 

multiplying the normalized amplitude value coordinates of the sinusoidal wave 
components of the input voice signal with the mean amplitude of the input voice signal. 

27. (currently amended) A machine readable medium used in a computer 
machine having a CPU for synthesizing an output voice signal from an input voice 
signal, the medium containing program instructions executed by the CPU for causing 
the computer machine to perform the method comprising the steps of: 

analyzing only deterministic components contained in the input voice signal to 
derive a parameter set of an original frequency and an original amplitude, the 
deterministic components including a plurality of sinusoidal wave components which are 
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numbered sequentially, wherein the input voice signal includes the deterministic 
components and residual components; 

separating means for separating the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates ASn' (n = 1, 2, 3, . ..); 

providing reference information characteristic of the reference voice signal, 
including reference amplitude information representative of amplitude value coordinates 

ATn (n = 1,2,3,...); 

modulating the amplitude value coordinates ASn' according to the reference 
amplitude information representative of the amplitude value coordinates ATn by the 
following calculation (1 - [[y]] y) * ASn' + [[y]] y * ATn (n = 1, 2, 3, . . .)■ where the 
parameter [[y]] y takes a value from zero to one and represents a degree of mixing , to 
generate modulated amplitude value coordinates; 

regenerating each of the sinusoidal wave components according to each of the 
modulated parameter sets so that at least one of the frequency and the amplitude of 
each regenerated sinusoidal wave components varies from the original one, and 

mixing the regenerated sinusoidal wave components together to synthesize the 
output voice signal; 

separating the sinusoidal wave components into frequency value coordinates 
and amplitude value coordinates, which are numbered sequentially in a manner the 
same as the sinusoidal wave components; 

modulating the amplitude value coordinates of the sinusoidal wave components 
of the input voice signal according to the reference amplitude information, 
representative of reference amplitude value coordinates which are numbered 
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correspondingly to the amplitude value coordinates of the input voice signal such that 
each amplitude value coordinate of the input voice signal is mixed with the 
corresponding reference amplitude value coordinate by a set ratio, of the sinusoidal 
wave components contained in the reference voice signal, to generate modulated 
amplitude value coordinates; and 

combining the modulated frequency value coordinates and the modulated 
amplitude value coordinates to synthesize sinusoidal wave components of the output 
voice signal having an output pitch and an output timbre different from an input pitch 
and an input timbre of the input voice signal, and influenced by a reference pitch and a 
reference timbre of the reference voice signal, 

28. (Original) The apparatus as claimed in claim 17, wherein the 
parameter set is in the form of a plurality of frequency value and amplitude value 
coordinates, the frequency value coordinates representing the original frequency and 
the amplitude value coordinates representing the original amplitude. 

29. (Original) The machine readable medium as claimed in claim 27, 
wherein the parameter set is in the form of a plurality of frequency value and amplitude 
value coordinates, the frequency value coordinates representing the original frequency 
and the amplitude value coordinates representing the original amplitude. 

30. (Original) The apparatus as claimed in claim 1 , wherein the extracting 
means utilizes Fast Fourier Transform and a peak detecting means to extract the 
plurality of sinusoidal components from the input voice signal, the Fast Fourier 
Transform being carried in prescribed frame units to create a frequency spectrum 
successively for each frame of the input voice signal, the peak detecting means 



600143395v1 



16 



PATENT 
51270-245583 

detecting peaks in the frequency spectrum to extract the frequency value coordinates. 

31 . (Original) The apparatus as claimed in claim 9, wherein the extracting 
means utilizes Fast Fourier Transform and a peak detecting means to extract the 
plurality of sinusoidal components from the input voice signal, the Fast Fourier 
Transform being carried in prescribed frame units to create a frequency spectrum 
successively for each frame of the input voice signal, the peak detecting means 
detecting peaks in the frequency spectrum to extract the amplitude value coordinates. 

32. (Original) The apparatus as claimed in claim 1 7, wherein the analyzer 
device utilizes Fast Fourier Transform and a peak detecting means to derive the 
parameter set representing the corresponding sinusoidal wave component, the Fast 
Fourier Transform being carried in prescribed frame units to create a frequency 
spectrum successively for each frame of the input voice signal, the peak detecting 
means detecting peaks in the frequency spectrum to extract the parameter set. 

33. (Original) The method as claimed in claim 25, wherein the extracting 
step involves utilizing Fast Fourier Transform and peak detection to extract the plurality 
of sinusoidal components from the input voice signal, the Fast Fourier Transform being 
carried in prescribed frame units to create a frequency spectrum successively for each 
frame of the input voice signal, the peak detection detecting peaks in the frequency 
spectrum to extract the frequency value coordinates. 

34. (Original) The method as claimed in claim 26, wherein the extracting 
step involves utilizing Fast Fourier Transform and peak detection to extract the plurality 
of sinusoidal components from the input voice signal, the Fast Fourier Transform being 
carried in prescribed frame units to create a frequency spectrum successively for each 
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frame of the input voice signal, the peak detection detecting peaks in the frequency 
spectrum to extract the amplitude value coordinates. 

35. (Original) The machine readable medium as claimed in claim 27, 
wherein the analyzing step involves utilizing Fast Fourier Transform and peak detection 
to derive the parameter set representing the corresponding sinusoidal wave 
component, the Fast Fourier Transform being carried in prescribed frame units to 
create a frequency spectrum successively for each frame of the input voice signal, the 
peak detection detecting peaks in the frequency spectrum to extract the parameter set. 

36. (Original) The apparatus according to claim 1 , wherein the 
deterministic components include peak values of the input voice signal in a frequency 
spectrum. 

37. (Original) The apparatus according to claim 1 , wherein the residual 
components include deviation components between a synthetic voice signal and the 
input voice signal. 

38. (Original) The apparatus according to claim 9, wherein the 
deterministic components include peak values of the input voice signal in a frequency 
spectrum. 

39. (Original) The apparatus according to claim 9, wherein the residual 
components include deviation components between a synthetic voice signal and the 
input voice signal. 

40. (Original) The apparatus according to claim 1 7, wherein the 
deterministic components include peak values of the input voice signal in a frequency 
spectrum. 



600143395v1 



18 



PATENT 
51270-245583 

41 . (Original) The apparatus according to claim 17, wherein the residual 
components include deviation components between a synthetic voice signal and the 
input voice signal. 

42. (Original) The method according to claim 25, wherein the deterministic 
components include peak values of the input voice signal in a frequency spectrum. 

43. (Original) The method according to claim 25, wherein the residual 
components include deviation components between a synthetic voice signal and the 
input voice signal. 

44. (Original) The method according to claim 26, wherein the deterministic 
components include peak values of the input voice signal in a frequency spectrum. 

45. (Original) The method according to claim 26, wherein the residual 
components include deviation components between a synthetic voice signal and the 
input voice signal. 

46. (Original) The machine-readable medium according to claim 27, 
wherein the deterministic components include peak values of the input voice signal in a 
frequency spectrum. 

47. (Original) The machine-readable medium according to claim 27, wherein 
the residual components include deviation components between a synthetic voice 
signal and the input voice signal. 

Claims 48 and 49 (cancelled). 

50. (previously presented) An apparatus for converting an input voice signal 
into an output voice signal according to a reference voice signal, the apparatus 
comprising: 
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extracting means for extracting only deterministic components from the input 
- voice signal, the deterministic components including a plurality of sinusoidal wave 
components which are numbered sequentially, wherein the input voice signal includes 
the deterministic components and residual components; 

separating means for separating the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates which are numbered sequentially in 
a manner the same as the sinusoidal wave components; 

memory means for storing reference pitch information representative of a pitch of 
the reference voice signal, and reference amplitude information representative of 
reference amplitude value coordinates, which are numbered sequentially, of the 
sinusoidal wave components contained in the reference voice signal; 

first modulating means for modulating the frequency value coordinates of the 
sinusoidal wave components of the input voice signal according to the reference pitch 
information retrieved from the memory means, to generate modulated frequency value 
coordinates; 

second modulating means for modulating the amplitude value coordinates of the 
sinusoidal wave components of the input voice signal according to the reference 
amplitude information representative of the reference amplitude value coordinates 
which are numbered correspondingly to the amplitude value coordinates of the input 
voice signal, retrieved from the memory means, such that each amplitude value 
coordinate of the input voice signal is mixed with the corresponding reference amplitude 
value coordinate by a set ratio; 

combining means for combining each of.the modulated frequency value 
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coordinates and each of the modulated amplitude value coordinates, which are 
processed separately from each other and which are numbered correspondingly to 
each other, to synthesize sinusoidal wave components of the output voice signal having 
an output pitch and an output timbre different from an input pitch and an input timbre of 
the input voice signal, and influenced by a reference pitch and a reference timbre of the 
reference voice signal; and 

mixing means for mixing the synthesized sinusoidal wave components having 
the modulated frequency value coordinates to synthesize the output voice signal having 
a pitch different from that of the input voice signal and influenced by the pitch of the 
reference voice signal. 

51 . (currently amended) An apparatus for converting an input voice signal into 
an output voice signal according to a reference voice signal, the apparatus comprising: 

extracting means for extracting only deterministic components from the input 
voice signal, the deterministic components including a plurality of sinusoidal wave 
components which are numbered sequentially, wherein the input voice signal includes 
the deterministic components and residual components; 

separating means for separating the sinusoidal wave components into frequency 
value coordinates and amplitude value coordinates ASn' (n=1, 2, 3, ...); 

memory means for storing reference pitch information representative of a pitch of 
the reference voice signal, and reference amplitude information representative of 
reference amplitude value coordinates ATn (n = 1, 2, 3, ...) of the sinusoidal wave 
components contained in the reference voice signal; 

first modulating means for modulating the frequency value coordinates of the 
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sinusoidal wave components of the input voice signal according to the reference pitch 
information retrieved from the memory means, to generate modulated frequency value 
coordinates; 

second modulating means for modulating the amplitude value coordinates ASn' 
of the sinusoidal wave components of the input voice signal according to the reference 
amplitude information representative of the reference amplitude value coordinates 
representative of the amplitude value coordinates ATn retrieved from the memory 
means by the following calculation (1 - [[y]] y_)*ASn' + [[y]] y_*ATn (n = 1 , 2, 3, ...), where 
the parameter [[y]] y_ takes a value from zero to one and represents a degree of mixing ; 

combining means for combining each of the modulated frequency value 
coordinates and each of the modulated amplitude value coordinates, which are 
processed separately from each other and which are numbered correspondingly to 
each other, to synthesize sinusoidal wave components of the output voice signal having 
an output pitch and an output timbre different from an input pitch and an input timbre of 
the input voice signal, and influenced by a reference pitch and a reference timbre of the 
reference voice signal; and 

mixing means for mixing the synthesized sinusoidal wave components having 
the modulated frequency value coordinates to synthesize the output voice signal having 
a pitch different from that of the input voice signal and influenced by the pitch of the 
reference voice signal. 
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